Extracting topic-sensitive content from textual documents—A hybrid topic model approach
نویسندگان
چکیده
منابع مشابه
Extracting Topic Maps from Web browsing histories
In this paper, we propose a method of clustering to extract Topic Map from the Web browsing history. The our method is based on the traditional agglomerative clustering with the constraint of the Web structure and the weight of link relation. Topic Map shows 2Dvisualized overview graph of the Web browsing history, and the relations between the topics that are gathered by a user and extracted fr...
متن کاملA Hybrid Neural Network-Latent Topic Model
This paper introduces a hybrid model that combines a neural network with a latent topic model. The neural network provides a lowdimensional embedding for the input data, whose subsequent distribution is captured by the topic model. The neural network thus acts as a trainable feature extractor while the topic model captures the group structure of the data. Following an initial pretraining phase ...
متن کاملA review of text mining approaches and their function in discovering and extracting a topic
Background and aim: Four text mining methods are examined and focused on understanding and identifying their properties and limitations in subject discovery. Methodology: The study is an analytical review of the literature of text mining and topic modeling. Findings: LSA could be used to classify specific and unique topics in documents that address only a single topic. The other three text min...
متن کاملTopic Modelling and Event Identification from Twitter Textual Data
The tremendous growth of social media content on the Internet has inspired the development of the text analytics to understand and solve real-life problems. Leveraging statistical topic modelling helps researchers and practitioners in better comprehension of textual content as well as provides useful information for further analysis. Statistical topic modelling becomes especially important when...
متن کاملTopic Segmentation with a Structured Topic Model
We present a new hierarchical Bayesian model for unsupervised topic segmentation. This new model integrates a point-wise boundary sampling algorithm used in Bayesian segmentation into a structured topic model that can capture a simple hierarchical topic structure latent in documents. We develop an MCMC inference algorithm to split/merge segment(s). Experimental results show that our model outpe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Engineering Applications of Artificial Intelligence
سال: 2018
ISSN: 0952-1976
DOI: 10.1016/j.engappai.2017.12.010